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Abstract 

We  are  concerned  with  the  allocation  of  transmitter  time  and  power  for  randomly 
time  varying  mobile  data  communications.  Time  is  divided  into  small  scheduling  intervals, 
called  slots,  and  information  on  the  channel  rates  for  the  various  users  is  available  at  the 
start  of  the  slot,  when  the  user  selections  are  made.  There  is  a  conflict  between  selecting 
the  user  set  that  can  get  the  most  immediate  data  through  and  helping  users  with  poor 
average  rates.  The  Proportional  Fair  Sharing  method  (PFS)  deals  with  such  conflicts.  In 
[5,  6]  the  convergence  and  basic  qualitative  properties  were  analyzed.  Stochastic  approxi¬ 
mation  results  were  used  to  analyze  the  long  term  properties.  The  paths  of  the  (suitably 
interpolated)  throughputs  converge  to  the  solution  of  an  ODE,  akin  to  a  mean  flow.  The 
ODE  has  a  unique  equilibrium  point.  It  is  asymptotically  stable  and  optimal  in  that  it  is 
the  maximizer  of  a  concave  utility  function.  There  is  a  large  family  of  such  algorithms, 
each  member  corresponding  to  a  concave  utility  function.  The  basic  idea  of  PFS  extends 
to  many  systems  of  current  importance  for  which  it  was  not  originally  intended,  and  a 
variety  of  such  extensions  are  treated  here  to  illustrate  the  possibilities.  One  might  have 
minimal  throughput  constraints,  nonlinear  dependence  of  rate  on  allocated  power,  minimal 
SNR  requirements,  etc.  In  some  recent  applications,  the  number  of  slots  in  a  scheduling 
intervals  is  random,  and  the  length  is  not  known  when  the  selection  is  made.  The  form  of 
the  PFS  rule  is  adapted  to  the  application.  Then  the  basic  results  continue  to  hold.  The 
asymptotic  properties  of  the  ODE  characterize  the  behavior  of  the  algorithm. 


1  Introduction  and  Background 

Consider  the  problem  where  there  are  N  users  wishing  to  transmit  data  from  a  single  base 
station  to  N  mobile  destinations,  and  the  rates  of  transmission  are  randomly  varying. 
Time  is  divided  into  small  scheduling  intervals,  called  slots.  Until  further  notice,  in  each 
slot  one  user  is  chosen  to  transmit.  New  systems  estimate  the  rates  by  estimating  the 
SNR,  by  use  of  a  pilot  signal,  with  a  very  short  delay.  Rayleigh  fading  rates  of  a  few 
tens  of  Hertz  can  be  accommodated.  The  selection  of  the  user  is  based  on  a  balance 
between  the  current  possible  rates  and  “fairness.”  The  Qualcom  algorithm,  proportional 
fair  sharing  (PFS).  performs  this  by  comparing  the  given  rate  for  each  user  with  its 
average  throughput  to  date. 
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The  evolution  of  the  throughput  sequence  under  PFS  can  be  represented  in  a  recur¬ 
sive  manner,  as  a  stochastic  approximation  (SA)  [7]  algorithm,  and  results  of  SA  used  to 
analyze  its  properties.  [5,  6]  dealt  rigorously  with  the  asymptotic  and  qualitative  prop¬ 
erties.  The  sequence  of  suitably  interpolated  throughputs  converges  to  the  solution  of  a 
“mean”  ODE,  which  has  a  unique  and  asymptotically  stable  equilibrium  point  9.  The 
throughput  paths  converge  to  9.  This  in  turn,  gave  other  properties  (e.g,,  quantifying 
the  scheduling  gains  over  simple  TDMA).  The  algorithm  was  optimal  in  that  no  other 
can  achieve  a  larger  long  run  value  of  an  associated  utility  function  than  PFS.  This  paper 
continues  the  work  in  [5,  6].  The  original  algorithm  assigns  only  one  user  at  a  time,  at  full 
power  and  with  no  constraints.  But  the  way  that  fairness  was  defined  and  the  interpreta¬ 
tion  in  terms  of  a  utility  function  can  be  exploited  to  obtain  natural  PFS  algorithms  for 
many  other  types  of  systems  of  current  interest,  with  all  of  the  analytical  results  carrying 
over.  Section  2  reviews  the  past  work,  with  special  emphasis  on  a  property  of  the  system 
known  as  cooperativeness.  Sections  3  and  4  concern  a  selection  of  extensions,  opening 
possibilities  to  a  wider  exploitation  of  the  basic  ideas:  Many  users  might  be  scheduled 
simultaneously;  channel  rates  might  be  nonlinear  functions  of  the  allocated  power;  there 
might  be  constraints.  The  appropriate  form  of  the  PFS  rule  is  developed.  There  are 
applications  where  the  number  of  slots  in  a  scheduling  interval  is  actually  random.  In  all 
these  cases,  the  basic  theorems  carry  over  under  appropriate  conditions.1 


2  The  PFS  Algorithm:  Outline  of  Past  Results 


The  original  PFS  algorithm.  This  section  describes  the  original  PFS  algorithm  and 
some  past  results.  There  are  N  users,  each  with  infinite  backlog,2  sending  data  to  mobiles 
via  a  single  transmitter,  with  the  possible  rates  of  each  varying  randomly.  Users  are 
scheduled  one  at  a  time.  If  user  i  is  selected  in  slot  n,  then  it  transmits  rvn  units  of 
data,  where  {ritn,  n  <  oo}  is  a  random  sequence.  The  end  of  slot  n  is  called  time  n.  One 
definition  of  throughput  up  to  time  n  for  user  i  is3 

n 

9%n  =  Y.rmlm/n,  h,i  =  1  indicates  user  i  chosen  in  slot  /,  (1) 

i=i 

9i.n+ 1  9i  n  T  en  [Iitn+l^i,n+l  ^i,n]  j  ^ n  l/(^  T  1)-  (2) 

An  alternative  definition  of  throughput  discounts  past  values  as 

n 

9\n  =  (1  —  e)n9ei0  +  e^(l  —  e)n~lr^ili^  e  >  0  small  discount  factor,  (3) 

1=1 

where  9en  is  the  discounted  throughput  at  time  n.  In  recursive  form,  (3)  is 


Oln+ 1  =  n. 


+  6 


*i,n+1^2,ri+l 


61 


(4) 


(2)  and  (4)  are  of  the  SA  form  [7] ,  and  the  results  of  SA  were  used  for  their  analysis. 
The  approach  is  able  to  handle  quite  general  queues,  arbitrary  channel  rate  processes, 

Alternative  points  of  view,  which  do  not  seem  suitable  for  the  general  queueing  problem,  but  which 
can  be  useful  if  the  cooperativeness  property  doesn’t  hold,  are  in  [1,  10].  The  conditions  are  stronger: 
for  example,  [1]  seems  to  exclude  common  cases  where  the  channels  are  independent. 

2The  case  where  data  arrives  at  random  and  is  queued  until  transmitted  is  covered  in  [6]. 

3If  desired,  en  in  (2)  can  go  to  zero  more  slowly  than  1/n,  useful  for  iterate  averaging  [5]. 


and  algorithm  formulations.  The  basic  algorithm  and  results  are  typical  of  a  large  family, 
indexed  by  some  concave  utility  function,  and  which  is  actually  maximized  by  the  asso¬ 
ciated  algorithm.  The  set  of  algorithms  allow  a  continuum  of  tradeoffs  between  current 
rate  and  throughput,  and  the  same  advantage  holds  for  the  extensions  to  follow.  In  [6] 
both  (2)  and  (4)  were  treated,  differing  mainly  in  notation.  We  work  only  with  (4); 
identical  results  hold  for  (2). 

Let  di  >  0,  which  can  be  as  small  as  desired.  The  original  PFS  algorithm  chooses  the 
user  at  time  n  maximizing  in 


argmax{ryn+i/[6bn  +  d,] ,  i  <  N} 


(5) 


Definitions  and  outline  of  results  for  (4).  Define  9en  =  {9\n,i  <  N}  and  Rn  = 

{^n,j  <  N}.  The  usual  SA  large  time  analysis  uses  continuous  time  interpolations. 
Define  9e(-)  (components  9f(-),i  <  N )  by  9e(t )  =  9en  for  t  G  [ne,  ne  +  e).  Let  denote  the 
data  {Ri  :  l  <  n}  to  time  n,  and  En  the  expectation  conditioned  on  £n.  The  assumptions 
are  just  examples  of  the  possibilities.  See  [5,  6]  for  weaker  assumptions.  The  last  part 
of  A3  is  used  to  avoid  degeneracy  and  is  unrestrictive.  It  holds  if  when  a  component 
9{  is  small  enough  then  there  is  a  nonzero  chance  that  user  i  will  be  chosen,  no  matter 
what  the  values  of  the  other  components  of  9.  A1-A3  hold  under  Rayleigh  fading.  The 
continuity  in  Al  holds  when  successive  rates  differ  by  some  “randomness”  so  that  small 
changes  in  9  will  change  only  slightly  the  conditional  probability  that  any  user  is  selected. 
(A3)  is  used  to  show  that  the  limit  point  is  unique.  (6)  is  weaker  than  the  law  of  large 
numbers.  We  will  abuse  terminology  by  using  9-L  and  9j  for  the  i-th  and  j-th  components 
of  the  vector  9. 

Al.  For  each  i,n,  the  functions  hi,n(9,^n)  EvTi,n+\I{ri  ore 

continuous.  Let  6  >  0.  Then  in  {9  :  9i  >  6,  i  <  N},  the  continuity  is  uniform  in  n,fn. 

A2.  {Rn,n  <  oo}  is  stationary.  Define  hi{9 )  =  Eril{ri/rj>^di+ei)^dj+ej)^  ( stationary 
expectation).  h(-)  is  continuous.  (In  fact,  A3  implies  that  h(-)  is  Lipschitz  s  [6].)  Also, 


n+m—  1 


™  ^nri,l+ll{ri^i+1/rjji+1>(di+di)/(dj+0j),j^i}  hi(9) 


=  0 


(6) 


l=n 


in  probability.  There  are  small  positive  8,  such  that  hj(9)  >  b\  for  9i  <  8,  i  <  N. 
A3.  Rn  is  defined  on  some  bounded  set  and  has  a  bounded  density. 

The  limiting  process.  The  next  theorem  is  a  standard  result  in  SA.  It  says  that  the 
limit  points  of  (4),  are  contained  in  those  of  the  ODE  (7). 

Theorem  1.  ([7,  Theorems  2.2  and  2.3,  Section  8.2].)  Assume  algorithm  ((4),  (5)),  Al, 
and  A2.  Then  for  any  bounded  set  of  initial  conditions,  any  subsequence  of  9€(q€e  +  -)  has 
a  further  subsequence  that  converges  weakly  to  the  set  of  limit  points  of  the  solution  of 

9i  =  hi{9 )  -  Oi,  i<  N.  (7) 


Comment  on  a  monotonicity  property.  See  [5,  6].  The  proofs  of  Theorems  2-4  use 
a  monotonicity  property  together  with  the  strict  concavity  of  the  log  function,  properties 
that  are  essential  for  the  the  extensions.  For  vectors  X,Y  e  JRN ,  write  A"  >  Y  (resp., 
X  >  Y )  if  Xi  >  Yi  for  all  i  (resp.,  and  also  A"  ^  Y).  The  ODE  (7)  has  an  important 
property,  called  the  K  condition.  A  function  /(•)  is  said  to  satisfy  the  A'-condition  if 
for  any  x,y,i ,  with  x  <  y  and  Xi  =  y fi{x)  <  fi(y).  In  our  case,  f(9)  =  h(9)  —  9,  and 
the  condition  holds.  The  condition  says  simply  the  following:  If  9,L  is  increased,  then  the 


other  users  are  not  less  likely  to  be  chosen.  Systems  satisfying  the  lb-condition  are  said 
to  be  cooperative.  The  condition  implies  the  following  important  monotonicity  theorem. 

Theorem  2.  [9,  Proposition  1.1]  Let  /(•)  be  Lipschitz  continuous  and  assume  the  K- 
condition  for  x  =  f(x).  If  x( 0)  <  y(0)  (resp.,  <),  then  x(f|a;(0))  <  x(t\y(0))  (resp.,  <), 
where  x(t\y )  is  the  solution  under  initial  condition  y. 

The  next  result  says  that  the  throughputs  converge  to  unique  limiting  values 

Theorem  3.  Assume  rule  ((4),  (5))  and  A1-A3.  The  ODE  (7)  is  cooperative  and  the 
limit  point  9  is  unique  and  asymptotically  stable.  9e(t  +  •)  — >  8  as  e  — ►  0  and  t  — >  oo. 
Maximizing  a  utility  function.  Theorem  3  shows  that  there  is  a  unique  asymptotically 
stable  limit  point  9  of  the  ODE  and  algorithm.  The  algorithm  is  also  optimal,  as  follows. 
PFS  is  a  type  of  “ascent”  algorithm  for  the  strictly  concave  utility  function 

tf(0)  =  Elog(4  +  *i)-  (8) 

i 

By  a  first  order  Taylor  expansion, 

,)  -  U(0‘„)  =  e  EfowiWt  -  sy  /[d,  +  +  0(4).  (9) 

i 

Since  Y^ih,n+  i  =  1,  to  maximize  the  Erst  order  term  one  must  choose  by  (5).  The 

forms  of  the  extensions  are  motivated  by  the  maximization  of  the  first  order  term  in  such 
expansions.  Theorem  4  asserts  that  the  (5)  maximizes  U (•)  [5]. 

Theorem  4. 4  Under  A1-A3,  (5)  maximizes  lim nU{9n). 

An  example:  Rayleigh  fading.  Consider  independent  users  with  received  signal  power 
determined  by  stationary  Rayleigh  fading  and  with  additive  noise  of  constant  power.  Let 
the  rates  be  proportional  to  the  SNR,  with  mean  rates  1  //3i,i  <  N.  Then  hf-)  can  be 
evaluated  for  di  =  0  [6]:  0*  =  G(N)/N  ■  1//3*  where  G(N )  =  Y.f=i  1  / j-  This  represents 
a  gain  over  classical  TDMA,  where  9.,  =  1  /{N (5 f\.  This  was  noted  in  [4],  under  stronger 
conditions.  The  fact  that  the  limit  throughput  is  proportional  to  the  mean  possible 
rate  is  a  consequence  of  the  facts  that  the  stationary  distribution  for  Rayleigh  fading 
is  exponential,  the  channels  mutually  independent,  and  the  scheduling  rule  is  (5).  In 
[3],  under  the  same  conditions  it  is  shown  that  all  users  get  the  same  fraction  of  slots, 
asymptotically.  The  proofs  of  these  properties  depend  on  the  uniqueness  of  the  limit. 

3  Extensions 

3.1.  Nonlinear  Dependence  of  Rate  on  Power.  The  Model  and  PFS  Rule.  We 

now  allow  several  users  to  be  scheduled  in  each  slot,  with  a  total  power  u.  Consider  a 
channel  with  white-Gaussian  noise  and  signal  to  noise  ratio  S/N ,  with  an  unrestricted 
signalling  alphabet.  Then  the  channel  capacity  is  log(l  +  ,S'/./V).5  Let  u%  denote  the  power 
assigned  to  user  i  at  time  n,  ayn+1  the  power  attenuation  from  transmitter  to  mobile  i,  and 
Nhn+i  the  receiver  noise  power.  Define  bitn+i  =  ay„+i/ACn+1.  Then  the  corresponding 

4A11  of  the  above  results  can  be  extended  to  algorithms  based  on  any  strictly  concave  utility  function, 
not  just  (8).  This  yields  a  family  of  algorithms  that  allow  different  tradeoffs  between  the  values  of  the 
current  rates  and  throughputs  [5] . 

°One  can  suppose  that  there  are  many  symbol  intervals  per  code  word,  and  many  code  words  per 
slot,  so  that  the  capacity  formulas  can  be  used.  This  formula  is  for  an  unrestricted  signalling  alphabet. 
But,  for  small  SNR  values  (say,  <  4  clb),  it  can  be  well  approximated  even  by  binary  alphabets  [8]. 


channel  capacity  for  user  i  in  slot  n  +  1  is  vrijn+i(u)  =  v  log(l  +  bi^n+iUi)1  where  v  denotes 
the  number  of  symbol  intervals  per  slot.6  Redehne  Qi  to  be  the  throughput  divided  by  v. 
Then  (9)  becomes 


u(K+i)  - U(K) -«JXi  +  M  - njm,n  +  4]  +  0(t2).  (10) 

i 

We  can  suppose  that  the  SNR  determined  from  the  pilot  signal  is  a^n+i/N^n+i,  that  for 
unit  power.  Due  to  the  strict  concavity  in  the  tq,  the  power  might  be  distributed  among 
more  than  one  user.  Using  the  motivation  for  (5)  in  terms  of  the  maximization  of  the 
first  order  increment  in  the  log  utility  function  in  (9),  a  natural  analog  of  (5)  is 

max ^ [log (f  +  bhn+xUi)]/[Qt  +  d,:],  subject  to  J2ui  <  u,  ut  >  0.  (11) 

i  i 

Let  u(9,b)  =  {ui(9,b),i  <  N}  denote  the  maximizer.  Then  (4)  is  replaced  by 


^i  = 


0*+e 


log(l  +  bi>n+1Ui(0en,  bn+ 1))  -  9en 


(12) 


Computing  the  maximizer  u(9,  b )  is  not  difficult,  and  we  now  turn  to  this  problem. 
Finding  the  maximizer  in  (11).  For  simplicity  drop  the  n  and  e,  and  write  bi  n+ 1  as 
bi,  and  9,  for  9i  +  di.  First,  drop  the  constraint  tq  >  0  and  form  the  Lagrangian 


L{K,  Ab  0)  =  £[log(l  +  biUi)\/[9i\  +  / i 

i 


where  /i  is  the  multiplier.  If  bi  =  0,  then  drop  the  ith  summand.  Thus,  we  can  suppose 
that  all  bi  >  0,  9,L  >  0.  Differentiating  with  respect  to  Ui  yields  bi/[(l+biUi)9i\—  fi  —  0,  i  < 
N.  Thus,  we  need  only  solve  b\/[{  1  +  b\Ui)9i]  =  hi/[(  1  +  biUi)9i],  2  <  i  <  N\Y^iui  —  u, 
or,  equivalently, 


9iUi-OiUi  =  [bi0i-bi0i]/bibi  =  Cj,  2  <i<N;  ^tq  =  -u,  (13) 

i 

(13)  has  a  unique  Lipschitz  continuous  solution  u(9,b )  =  {ui(0,b),i  <  N}.  7 

Suppose  that  some  components  of  the  solution  are  negative.  The  fact  that  each 
summand  in  (12)  depends  only  on  a  single  component,  together  with  the  concavity  of 
the  log  function,  implies  that  these  negative  components  will  be  zero  for  the  problem 
where  the  nonnegativity  constraint  is  imposed.  So  drop  these  components  and  resolve 
(13).  Continue  until  all  components  are  nonnegative.  We  can  allow  the  6*  to  be  discrete¬ 
valued;  they  need  not  have  a  density.  The  density  in  (A3)  was  used  only  to  assure  the 
Lipschitz  property  of  h(-)  dynamics.  Here  that  is  assured  by  the  smoothness  of  u(9,b). 

The  limit  ODE  and  convergence  theorems.  Al  A3  are  not  needed,  except  for  the 
last  part  of  A2.  (6)  is  replaced  by  the  following.  Let  bn  denote  {bi}H,i  <  A^}. 

6 We  have  not  included  the  effects  of  mutual  interference  among  the  users.  With  orthogonal  spreading 
sequences,  the  interference  will  be  small.  Also,  in  many  cases,  it  does  not  depend  on  how  the  power  is 
distributed,  only  on  the  total  power.  Then  SIR  replaces  SNR  and  the  results  continue  to  hold.  General 
interference  can  be  handled  in  that  the  theorems  remain  valid.  But  the  computation  of  the  power 
allocation  can  be  quite  hard. 

7If  U(-)  is  replaced  by  any  strictly  concave  utility  function,  then  the  resulting  system  still  has  the 
form  of  (13),  but  with  different  coefficients. 


A4.  For  any  bounded  and  continuous  real-valued  function  /(•),  there  is  a  constant  f: 


2  n-\-m—  1 

lim  —  ^  En  f(bf)  —  f  — *  0,  in  probability. 

l=n 


(14) 


The  limit  ODE  is 

ei  =  1,(0)  -et,  i<  n,  (15) 

where  hfO)  is  defined  by  (14)  for  f(bn )  =  log(l  +  bi,nUi(0,bn)).  The  cooperative  property 
and  Theorems  1,  3,  and  4  holds.  The  proof  that  (15)  is  the  limit  ODE  is  as  in  [5,  6],  as  is 
the  proof  that  there  is  a  unique  equilibrium  point  0  such  that  the  paths  starting  at  small  0 
all  go  to  0.  Similarly,  for  the  proof  that  all  paths  end  up  in  Q(0)  =  {0  :  6  >  6}.  One  only 
needs  to  show  that  paths  starting  in  Q(0 )  must  go  to  an  arbitrarily  small  neighborhood 
of  0,  and  stay  there.  The  details  differ  slightly  from  those  in  [6],  and  we  comment  briefly. 

The  proof  in  [6]  of  the  last  assertion  depended  on  the  fact  that  if  there  is  a  path  of 
the  ODE  with  value  0(t)  at  some  time,  where  0(t)  G  Q(0)  —  {0},  then  U(0(t))  <  0.  Then 
U(-)  can  be  used  as  a  Liapunov  function  (together  with  the  fact  that  all  paths  tend  to 
the  set  Q(0 ))  to  prove  the  final  convergence  and  stability  assertion.  Thus,  suppose  that 

U(0)  =  X)  (hi(0)  -  Of)/ ( d,  +  Of)  >  0,  for  some  0  G  Q(0 )  -  {0}  (16) 


Since  0  >  0  and  0  0,  (16)  implies  that 

J2(hS  -ei)/(di  +  oi)>  o.  (i7) 


Now,  consider  the  algorithm,  started  at  0,  but  with  the  power  allocation  rule  at  time 
n  max,ii)j</v  log(l  +  b^n+\uf) / (d,  +  0*)| .  I.e.,  the  power  allocation  u(0 ,  bn+ 1)  is  used. 
Modulo  an  error  of  order  0(e)t,  (12)  and  the  maximizing  property  of  u(-)  yields 


U(0ft))  -  U(0) 


1  log(l  +  bi^ufOf  bi )) 

_e2-  ^  di  +  fy 


oh 


1=0 


.  [,/v- 1  log^  +  Ki+iui(0,  bi )) 

^  - TTo* - 

i  1=0  “T  vi,l 


°h 


(18) 

where  [ ’t/e ]  =  integer  part  of  t/e,  and  0\  (with  interpolation  06(f)  is  from  (12)  under  (11). 
The  SA  arguments  that  led  to  Theorem  1,  together  with  (18),  imply  that  as  e  — >  0, 


U(0(t))  -  U(0) 


hfOjs))  -  Ofs)  ^  > 
d'i  +  0,(s ) 


rt  hj(0)  ~  0j(s) 
Jo  i  d,  +  Ofs) 


This,  inequality  (17),  and  the  fact  that  0  is  an  equilibrium  point  of  the  ODE,  imply  that 


UW))\,= „  =  £ 


US)  -  Sj 

d i  +  0 , 


=  o  >£ 


hfO )  —  0 , 
d ,  +  0 , 


>  0. 


This  contradiction  to  (16)  implies  that  U(0)  <0  for  0  G  Q(0 )  —  {0}.  Thus  U(0(-\0))  is 
strictly  decreasing  in  Q(0)  —  {0},  which  implies  that  a  path  starting  at  any  0(0)  G  Q(0 ) 
must  end  up  at  0,  completing  the  proof  that  0  is  a  unique  asymptotically  stable  point. 

3.2.  Minimum  Throughput  Requirement.  Now  let  there  be  constraints  on  the 
minimum  throughput  in  that  we  desire  0 ,  >  a,.  For  simplicity,  work  with  the  problem 


of  Section  2.  If  the  current  9t  value  is  less  than  a*  then  user  i  needs  to  be  given  some 
advantage  in  that  it  should  be  selected  even  for  some  rates  that  are  lower  than  what 
(5)  would  require.  But  how  much  advantage  ?  Overall  efficiency  is  still  a  concern, 
and  user  i  should  not  be  selected  if  its  current  rate  is  too  low.  This  reasoning  argues 
against  the  use  of  hard  constraints. s  Consider  the  following  “soft”  approach.  For  each 
i,  let  qi(9i)  be  a  differentiable  penalty  function  which  is  zero  if  9l  >  di,  negative  for 
9i  <  di,  and  whose  derivative  qi,eX9i)  is  non  decreasing  in  9i  as  decreases.  For  example, 
qi(9i)  =  Kt  max{(),  1 9i  —  di\(9i  —  a*)},  Ay  >  0.  Analogously  to  the  role  of  (9)  in  getting 
(5),  let  the  allocation  rule  maximize  the  first  order  increment  of  X)[log(0*  +  di)  +  qi{9i)\,  a 
strictly  concave  function.  [Or  using  whatever  utility  function  replaces  log(-).\  This  yields 
the  rule 

arg  max  {ri>n+1  [l /{9ei  n  +  di)  +  qifi.  (%)]  }  ,  (19) 

The  cooperative  property  holds,  and  Theorems  1,  3,  and  4  hold  under  A1-A3. 

The  use  of  the  penalty  function  allows  flexibility  near  the  boundary,  since  the  choice 
will  depend  on  both  the  current  rates  and  by  how  much  the  constraint  is  violated.  The 
use  of  penalty  functions  does  not  assure  that  the  constraints  will  be  satisfied.  They  might 
not  be  feasible.  But  it  does  give  an  advantage  to  users  whose  throughput  is  less  than 
desired,  and  it  puts  a  price  on  constraint  violations. 

3.3.  Minimal  SNR  Required.  Suppose  that  the  additive  noise  at  the  receivers  is 
white-Gaussian  and  that,  due  to  the  the  decoding  and  error  detection  method,  a  user 
would  not  be  selected  unless  a  minimal  SNR  at  the  receiver  of,  say,  k,  can  be  assured,  and 
several  users  might  be  selected  at  the  same  time.  If  a  user  is  selected,  then  the  coding 
is  based  on  the  SNR  being  k.  Let  bi>n+ 1  =  cq,n+i/A,;n+i  be  as  in  Example  3.1.  Then  if 
user  i  is  selected  we  must  have  6vn+  i ih  >  k  (he.,  tq  >  n/bi,n+ 1).  With  the  minimal  power 
applied  to  user  i,  the  amount  of  data  transmitted  is  a  constant  c*.  (5)  will  not  yield  the 
best  allocation,  since  the  power  required  to  get  the  best  ratio  Ci/[9i  +  d,]  might  yield  a 
greater  increment  in  the  utility  if  it  were  split  among  several  other  users. 

Let  1  =  indicator  function  of  the  event  that  user  i  is  selected  at  n.  Then 

V(K»)  -  W  =  «  -  »tnWl,n  +  d,]  +  0(€2).  (20) 

i 

This  tells  us  that  the  appropriate  PFS  algorithm  is  the  maximization  of 

'X2cilt,n+i/[9ln  +  di],  total  power  constraint  «X!4n+i/^,n+i  <  u.  (21) 

i  i 

The  problem  (21)  is  a  form  of  the  knapsack  problem,  since  the  Jj)Tl+ 1  are  {0,  l}-valued.8 
The  cooperative  property  holds  and,  under  A1-A3,  Theorems  1,  3,  and  4  hold. 

A  simpler  form.  A  computationally  simpler  form  drops  the  integer  restriction  and 
allows  the  Jijn+ 1  be  take  arbitrary  values  in  the  interval  [0,1].  Then  (21)  is  a  linear 
program.  Then  solve  (21)  and  select  user  i  with  the  probability  Ii.n+\-  This  might  violate 
the  instantaneous  total  power  constraint  u,  but  it  can  be  shown  that  the  constraint  will 
hold  on  the  average,  which  is  enough  for  many  applications.9 

8The  throughput  capabilities  of  PFS  depends  not  only  on  the  channels,  but  also  on  the  processing 
capabilities  of  the  scheduler.  These  are  continually  improving.  In  problems  more  complicated  than 
that  of  Section  2,  we  should  expect  that  the  computation  would  be  more  complicated.  While  the  exact 
solution  of  the  general  form  of  the  knapsack  problem  can  take  too  much  time  at  present,  in  practice  one 
would  devise  an  algorithm  yielding  an  acceptable  approximation  for  the  special  problem  at  hand. 

9Again,  the  LP  algorithm  would  be  specialized  so  that  it  efficiently  yields  an  acceptable  solution  for 
the  problem  at  hand. 


3.4.  Random  Length  Scheduling  Intervals.  There  are  recent  applications  (say, 
in  EV-DO  systems)  where  the  number  of  slots  required  by  a  user  selected  at  time  n  is 
random  and  not  known  at  the  time  of  selection.  Consider  the  following  example.  The 
selected  user  transmits  its  data  on  the  current  slot.  If  acknowledgment  is  not  received 
in  the  next  slot,  then  it  will  use  the  next  following  slot  to  transmit  additional  data 
(perhaps  repeating  the  original  transmission,  or  sending  additional  parity  bits).  This 
can  be  repeated  up  to  some  finite  maximum  number  of  times.  Then  the  number  of 
slots  required  by  the  selected  user  is  not  known  beforehand,  and  this  must  be  taken 
into  account  in  the  selection  rule.  One  application  is  where  the  channel  for  some  users 
changes  too  fast  to  be  well  estimated,  so  an  average  SNR  is  used  for  them.  But  over 
any  particular  slot,  the  actual  channel  might  be  good  or  poor,  and  the  scheme  tries  to 
adaptively  account  for  this,  using  feedback  from  the  receiver. 

Under  appropriate  conditions  an  ODE  can  be  obtained,  but  it  will  be  hard  to  analyze. 
A  simple  approximation  to  this  model,  when  one  user  is  selected  for  each  slot,  supposes 
that  the  acknowledgments  (if  any)  come  at  the  beginning  of  the  next  slot.  [I.e,  the  delay  is 
ignored.]  Then,  if  the  nth  selected  user  is  i ,  it  transmits  for  a  bounded  random  number  of 
slots  sein+1 ,  which  can  depend  on  the  current  channel  state  j.  Define  s^+1  =  h,n+  istn+i 
and  t®  =  Ya=\  si-  Let  r^+i  denote  the  amount  of  data  to  be  transmitted  during  the 
interval  [t*,  +  s^+1)  if  user  i  is  selected.  Since  the  time  between  selections  is  random, 

(3)  needs  to  be  modified  by  letting  9En  denote  the  discounted  throughput  vector  at  the 
(n  +  l)st  decision  time,  the  end  of  the  r^th  slot.  The  discounted  form  that  converges  to 
the  average  throughput  per  unit  real  time  (as  e  — >  0)  is 


Otn  =  (1  -  ^ 
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Ii  n  —  1  if  i  —  nth  selection, 


(22) 
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+  0(£2). 


(23) 


Let  j  =  channel  state  at  t®,  when  the  (n  +  l)st  scheduling  decision  is  made.  We  have 


UK+1)  -  U(9 '«)  =  e£  [nK+1kn+ 1  -  sen+ldei  n 


/ 


®i,n  +  di 


+  0(e2). 


(24) 


We  cannot  base  the  (n  +  l)st  decision  on  maximizing  (24),  since  s^+1  is  not  known  at  that 
time.  Suppose  that  the  expectation  of  sein+l,  conditioned  on  both  the  data  to  the  start 
of  the  (n  +  l)th  scheduling  interval  and  ri,T^+1  and  j,  depends  only  on  the  user  choice  i 
and  on  j  at  that  time,  and  write  it  as  Sj(j).  It  is  natural  to  replace  s\n+1  in  (24)  by  its 
conditional  expectation  Si(j).  Then  the  appropriate  PFS  rule  is  (max  over  the  {/i,n+i}) 


max 
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Li, n+l  [ElSl{j)Il,n+ 1]  0\,r 
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Otn  } 
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(25) 

The  choice  of  user  i  determines  Si(j),  the  mean  number  of  slots  until  the  next  selection, 
and  the  discounting  from  decision  time  +;  to  decision  time  r^+1  in  (22).  For  this  reason, 
one  cannot  separate  the  users  and  evaluate  them  one  at  a  time  as  in  (5).  Under  conditions 
similar  to  A1-A2,  Theorem  1  holds  with  the  ODE  being  6i  =  hi{6 )  —  s(6)9i,  i  <  N, 
where  s(9)  denotes  the  mean  number  of  slots  required  per  scheduling  period  when  the 
current  state  is  9 ,  and  hi(9)  is  the  average  throughput  for  user  i  over  a  typical  scheduling 
period.  Since  the  throughput  in  (22)  is  updated  at  the  iterate  times,  (26)  is  in  the  iterate 
time  scale  and  not  the  real  time  scale.  By  a  change  of  variables,  the  ODE  in  real  time 


becomes  0*  =  hi(9)/s(9 )  —  ?'  <  A/”,  where  hi(9)/s(9)  is  the  average  throughput  per 

unit  time  for  user  i  when  the  state  is  9.  If  the  rate  and  interval  length  sequences  are  i.i.d., 
then  the  system  can  be  shown  to  be  cooperative  and  Theorems  3  and  4  hold.  Results 
under  weaker  conditions  are  not  yet  available. 

The  method  used  in  applications  at  this  time  uses  (22),  but  the  decisions  use  the  rule 

arg  max  {e[  T i,T®_ |_1 1 i,n+ 1 

This  rule  is  simpler  than  (25).  But  it  does  not  account  for  the  expected  duration  of  the 
required  time.  Theorem  1  holds  under  conditions  similar  to  (Al)-(A2).  Simulations  for 
the  i.i.d.  case  indicate  that  the  limit  point  is  unique.  There  are  no  associated  optimality 
properties,  since  the  rule  does  not  maximize  an  increment  of  the  utility  function. 

Consider  the  case  where  there  are  two  users  with  i.i.d.  rate  and  interval  processes, 
whose  rate  processes  are  identically  distributed,  but  the  interval  processes  are  not.  Sim¬ 
ulations  imply  that  (26)  gives  each  the  same  limit  throughput.  But,  under  (25),  the  user 
with  the  smaller  mean  interval  has  a  larger  throughput,  as  expected,  and  as  is  consistent 
with  the  concept  of  fair  sharing,  since  its  effective  rate  is  higher.  The  average  through¬ 
put  seems  higher  under  (25).  So  the  new  form  (25)  is  preferable  to  (26).  Little  is  known 
about  the  performance  under  either  rule  when  the  processes  are  correlated.  Then  (25) 
will  in  general  not  be  optimal  and  there  is  room  for  improvement  by  taking  a  view  that 
is  longer  than  one  interval  at  a  time. 


9*1  „  +  di 


(26) 
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